Angrist and Krueger (1995)

Split-Sample Instrumental Variables Estimates of the Return to Schooling
Notes: This paper uses two data sets:

    1. A 1980 census extract, also used in Angrist and Krueger (1991). Below you can download the ASCII file containing 329,509 observations on the following variables:

        * log weekly wage
        * quarter of birth (1-4)
        * year of birth (30-39)
        * place of birth (1980 census state codes)
        * education (highest grade completed)

    2. A CPS extract which contains 30,967 observations on men born 1944-53 from the 1979 and 1981-85 March CPS, matched to lottery number dummies for groups of 25 lottery numbers. Below you can download the Stata data set. There are 72 variables including all covariates used in the JBES article and in our NBER working paper. Follow the sample selection rules in the notes to the tables to reproduce the 25, 781 observation working sample. The CPS data were first used in Angrist and Krueger's unpublished 1992 NBER working paper <http://www.nber.org/papers/w4067>. These data were also used in Alberto Abadie's (2002) JASA paper.

Programs: Click the links to download the program files:

    * samplcps.do, a sample Stata program that analyzes the CPS data set.
    * samplcps.log, the log file produced by running samplcps.do using the CPS extract.
    * ssivex1.sas , a sample SAS program that uses the 1980 census extract (which you can download below) to produce some example SSIV estimates in the spirit of Tables 1 and 2. The regressions in the paper include covariates which are not in the data set posted here, so this will not replicate exactly the results in the paper.

Data: Click the links to download the data sets:

    * 1980 census extract from Angrist and Krueger (1991)
    * CPS extract

Data summary:

    * 1980 census extract summary

    * CPS extract summary 